Extracting knowledge from the World Wide Web
نویسندگان
چکیده
منابع مشابه
Extracting knowledge from the World Wide Web.
The World Wide Web provides a unprecedented opportunity to automatically analyze a large sample of interests and activity in the world. We discuss methods for extracting knowledge from the web by randomly sampling and analyzing hosts and pages, and by analyzing the link structure of the web and how links accumulate over time. A variety of interesting and valuable information can be extracted, s...
متن کاملExtracting Patterns and Relations from the World Wide Web
The World Wide Web is a vast resource for information. At the same time it is extremely distributed. A particular type of data such as restaurant lists may be scattered across thousands of independent information sources in many di erent formats. In this paper, we consider the problem of extracting a relation for such a data type from all of these sources automatically. We present a technique w...
متن کاملExtracting ontologies from World Wide Web via HTML tables
Minoru Yoshida, Kentaro Torisawa and Jun’ichi Tsujii 1 Department of Computer Science, Graduate school of Information Science and Technology, 2 School of Information Science, Japan Advanced Institute of Science and Technology 3 Information and Human Behavior, PRESTO, Japan Science and Technology Corporation CREST, JST(Japan Science and Technology Corporation) Postal address: Department of Compu...
متن کاملKnowledge Retrieval and the World Wide Web
L ARGE-SCALE WEB SEARCH engines effectively retrieve entire documents, but they are imprecise, because they do not exploit and hence retrieve the semantic Web document content. We cannot automatically extract such content from general documents yet. Manually structuring Web documents— for example, with XML—lets us retrieve more precise information using stringand structure-matching tools, such ...
متن کاملLearning to Extract Symbolic Knowledge from the World Wide Web
The World Wide Web is a vast source of information accessible to computers, but understandable only to humans. The goal of the research described here is to automatically create a computer understandable knowledge base whose content mirrors that of the World Wide Web. Such a knowledge base would enable much more e ective retrieval of Web information, and promote new uses of the Web to support k...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the National Academy of Sciences
سال: 2004
ISSN: 0027-8424,1091-6490
DOI: 10.1073/pnas.0307528100